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Abstract 

The notion of recoverable value was advocated in work of Feige, Im- 
morlica, Mirrokni and Nazerzadeh [Approx 2009] as a measure of qual- 
ity for approximation algorithms. There this concept was applied to fa- 
cility location problems. In the current work we apply a similar frame- 
work to the maximum independent set problem (MIS). We say that an 
approximation algorithm has recoverable value p, if for every graph it 
recovers an independent set of size at least max/ X^ug/ min[l, p/{d(v) + 
1)], where d{v) is the degree of vertex v, and / ranges over all indepen- 
dent sets in G. Hence, in a sense, from every vertex v in the maximum 
independent set the algorithm recovers a value of at least p/{dt, + 1) 
towards the solution. This quality measure is most effective in graphs 
in which the maximum independent set is composed of low degree ver- 
tices. It easily follows from known results that some simple algorithms 
for MIS ensure p > 1. We design a new randomized algorithm for 
MIS that ensures an expected recoverable value of at least p > 7/3. 
In addition, we show that approximating MIS in graphs with a given 
fc-coloring within a ratio larger than 2/k is unique games hard. This 
rules out a natural approach for obtaining p > 2. 

1 Introduction 



The notion of recoverable value was advocated in work of Feige, Immor- 
lica, Mirrokni and Nazerzadeh [5] as a measure of quality for approximation 
algorithms. This notion leads to greater expressive power for stating the 
guarantees of approximation algorithms (compared to the standard notion 
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of approximation ratio) , by this leading to greater differentiation among the 
performance guarantees of different algorithms. The hope is that this con- 
cept will lead to the design of new algorithms with superior performance 
with respect to the recoverable value measure (regardless of whether the 
classic approximation ratio differs from that of existing algorithms), and 
moreover, that these algorithms will have better performance in practice (at 
least in some interesting special cases). 

In [5], the term PASS approximation (where PASS is an acronym for 
PArameterized by the Signature of the Solution) was used in order to cap- 
ture the two main features that we wish the recoverable value to have. One 
feature is that the recoverable value is expressed in terms of properties of 
the (unknown) solution, rather than of the input instance. The other is that 
the property of the solution that it refers to is not some aggregate property 
(such as average degree), but rather some signature in which the contribu- 
tion of each individual component of the solution is considered separately. 
Rather than try to present general principles here, let us focus on the prob- 
lem studied in the current paper, that of maximum independent set, and 
specialize the notion of recoverable value to this problem. 

We use the following notation. All graphs in this work are undirected. 
The degree d{v) of vertex v is the number of neighbors of v. The set of 
neighbors of v is N{v). The average degree of a graph is denoted by davg, 
and davg{u) = ]^ Ylv^u ^(^) denotes the average over degrees of vertices in 
a set U . An independent set in a graph is a set of vertices / such that every 
two vertices in / are non-neighbors. We shall refer to the problem of find- 
ing an independent set of maximum cardinality as MIS. The independence 
ratio of G = iV^E) is a{G) = l/maxl/l^l, where |/max| is the size of a max- 
imum independent set /. In the maximum weight independent set problem 
MWIS, every vertex v has a nonnegative weight and the goal is to find 
an independent set of maximum weight. 

We now present our notion of recoverable value for MIS. For an indepen- 
dent set /, we define its signature to be the sequence of degrees of vertices 
in /. The value that we shall want to recover from each vertex of I de- 
pends on its degree. With every degree d we associate a recoverable value of 
< < 1, and wish our approximation algorithms to find an independent 
set of size at least X]i;g/ Pd{v) ■ The independent set / is not known to the 
algorithm - it is only used in analyzing its performance measure. Hence this 
performance guarantee holds simultaneously with respect to all independent 
sets in the graph, including the independent set / that happens to maximizes 
the expression Ylvei Pd(v) (this / might not be the maximum independent set 
in the graph). The notion of recoverable value easily generalizes to MWIS, 
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by multiplying each term by w{v). For randomized algorithms, one consid- 
ers the expected size (or weight for MWIS) of the independent set that they 
return. 

Intuitively, the smaller the degree of v the more likely algorithms are to 
place V in an independent set (because v excludes a smaller number of other 
vertices), and hence the higher we would like its recoverable value to be. 
Hence it is natural for pd to be a non-increasing function of d. We find it 
convenient to introduce a parameter p (that we shall attempt to maximize 
later) and to consider the function pd = min[l,p/((i + 1)]. (Not allowing 
the recoverable value to exceed 1 is necessary, as we cannot find a solution 
larger than the optimal solution.) For graphs of minimum degree at least 
p — 1, such approximation algorithms need to find an independent set of size 
at least Ylv<^i P I {d{v) + 1). 

We refer to p in the expression min[l, p/ {d{v) + \)] as the canonical recov- 
erable value (though we sometimes omit the word canonical for brevity) . For 
comparison with some previously published algorithms, it is useful to note 
that for every set U of vertices, 'Y^^^u l/(di; + 1) > |f^|/(<^aDg(i/) + 1) (with 
equality only if all vertices in U have the same degree). Hence for a given 
value of p, our notion of recoverable value that is based on the signature of 
/ (its degree sequence) is more demanding than had we only considered the 
average degree of vertices in /. 

1.1 Our results 

It is not hard to see that some known algorithms achieve a canonical recov- 
erable value /9 > 1. We show (see Proposition [16] in the appendix) that they 
do not achieve a value of p bounded away from 1. 

One can readily observe that = pi = 1- We use a procedure that we 
call 2- elimination to that that for MIS one may enforce p2 = 1 as well. 

Thereafter, we design relatively simple new algorithms that achieve a 
recoverable value of p > 2 for MWIS and p > 7/3 for MIS. It is NP-hard 
to achieve p = 4, as this will imply exact solution of MIS in 3-regular 
graphs, a problem that is NP-hard and APX-hard [3j. However, in graphs 
in which the minimum degree 6 is large we provide an algorithm achieving 
p > 0(log(5/loglog5) for MWIS. 

Our investigations of the best recoverable value achievable by our ap- 
proaches also lead us to consider MIS in fc-colored graphs (graphs for which a 
/c-coloring is given). For this problem a2/k approximation ratio is known [12], 
and we show that improving it would refute the unique games conjecture. 
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Our proofs in some cases provide simple alternatives to previously pub- 
lished related results. 

1.2 Related work 

The strong inapproximability results for MIS p!T] justify searching other 
measures of performance guarantees, and the notion of recoverable value is 
one such candidate. It considers degrees of vertices, and hence it is instruc- 
tive to recall known approximation algorithms for MIS and MWIS and how 
their performance depends on the degree sequence of the graph. 

Random permutation. Choosing a permutation uniformly at random 
and taking all vertices that appear prior to their neighbors in the order 
induced by the permutation produces an independent set whose expected 
weight is at least ^^g^ d{v)+i (^^^ H' example). This guarantees a 
recoverable value of p > 1. 

Greedy. For MIS, iteratively picking a minimum degree vertex, adding 
it to an independent set I and deleting the vertex and its neighbors from the 
graph is guaranteed to find an independent set of size at least X^^gy d{v)+i 
[6l [16] . Halldorsson and Radhakrishnan |10j showed that this greedy algo- 
rithm produces an independent set of size at least \V\ ^^^^t|"i-|-Q, (where a de- 
notes the fraction of vertices in the maximum independent set). For MWIS, 
weighted greedy that iteratively picks a vertex v with minimum Wv/{dy + 1) 
is guaranteed to find an independent set of size at least Yliv^v d(v)+i P^ - 

LP. An integer programming formulation of MWIS is: 

maximize X^iev 

subject to 

Xi + Xj <1 for every edge j). 
Xi G {0, 1} for every vertex i. 

Consider the LP relaxation of this program where each Xi G [0,1]. A 
well known result due to Nemhauser and Trotter [14] asserts that there is 
an optimal solution for the relaxation such that for every i,Xi G {0, ^, 1}. 
Moreover, such an optimal solution can be found in polynomial time. 

LP-|-greedy. Consider the following algorithm. Find an optimal half 
integral solution to the LP, discard all the vertices assigned 0, keep all the 
vertices assigned 1, and run the greedy algorithm on the graph induced by 
all vertices that are assigned 1/2. This algorithm was analyzed for connected 
graphs. Hochbaum [12j proved an approximation ratio of ^ , and Hall- 
dorsson and Radhakrishnan [10] (based on their improved analysis of the 
greedy algorithm) proved an approximation ratio of ^ +3 ■ 
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SDP. Using semidefinite programming Halldorsson [9] provided approx- 
imation ratios of n( , '"S'^""-? ) for MIS, and . ) for MWIS on 

^davg log log davg ' ^ 5 log log 5 ^ 

5-inductive (a.k.a. 6-degenerate) graphs (in some permutation over the ver- 
tices 6 is the maximum backward degree). 

Local Search. In graphs of degree bounded by A, algorithms based 
on local search were shown to achieve an approximation ratio of roughly 
5/(A + 3), with some distinction between the cases of odd and even A 
(see O 3]). We remark that in the special case of A-regular graphs, the 
notion of recoverable value becomes equivalent to the traditional notion of 
approximation ratio, and our results are not as strong in this those 
achieved by local search. On the other hand, our algorithms are much faster 
than the local search algorithms. 

In terms of hardness results, Austrin, Khot and Safra [2j proved that ap- 
proximating independent set in graphs of maximum degree A within a ratio 
larger than is unique games hard. Recall that we present hardness 

results for finding independent sets in graphs where a fc-coloring is given. 
They essentially match the 2/k bounds achieved by known approximation 
algorithms |12j . We are not aware of previous published hardness results 
for this problem, but there are some results for related problems on hyper- 
graphs [8j, and hardness results for MIS in graphs with bounded chromatic 
number but when no coloring is given [7j. 

1.3 Our techniques 

Vertices of very low degree receive special treatment. For MIS we show 
that a certain 2-elimination technique implies p2 = 1- (See Section [2] for 
subtleties involved in the statement of this result.) 

Thereafter, following a recipe suggested in [5] (though there the problem 
considered was different, facility location), we present an algorithm based 
on a so called recoverable value LP (see Theorem [2]). This gives recoverable 
value of p = 2 for MWIS. For MIS we improve over this bound by using 
a new combination of some of the algorithms presented in Section 11.21 (A 
new combination is indeed needed, as we also provide examples showing that 
plain use of these algorithms does not even achieve p bounded away from 1.) 

Given a graph G, choose a random permutation vr on the vertices. We 
say that vertex v is in layer Li with respect to vr if exactly i — 1 of i;'s neigh- 
bors appear before v in n. For k > 1, let Gk be the subgraph of G induced 
on the vertices of the first i layers. The random permutation algorithm re- 
ferred to in Section [1.21 simplv returns Gi which is an independent set. Our 
new algorithms will instead consider Gk with small value of k (depending 
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on the context, we shall take k G {2, 3, (5 + 1}, where 5 denotes the minimum 
degree) , and on run some algorithm from Section 11.21 The advantage 
of our approach (in the context of recoverable value) is that low degree ver- 
tices are more likely to end up in Gk- Moreover, G^ (for small values of 
k) has special structure that makes finding large independent sets easier. 
For example, it turns out that Gk is A;-colorable. This suffices for obtaining 
a recoverable value of p = 2 (for MWIS on the original G), but does not 
guarantee anything better (by our new hardness of approximation results 
for approximating MIS in /c-colored graphs). The fact that Gk is {k — 1)- 
inductive allows us to obtain a recoverable value of /? = (log (5/ log log (5) 
(which improves over p = 2 when 5 is large). Our unconditional improve- 
ment over p = 2 uses G3 and applies only to MIS. For G3 we consider its 
average degree, which is no longer a deterministic property but rather a func- 
tion of several random variables (number of vertices in each layer). After 
showing that the 2^,,^ +3 approximation ratio of [10] extends to disconnected 
graphs (see Theorem , we show that the random variables can safely be 
replaced by their expectations (which are deterministic quantities), by this 
considerably simplifying the analysis. It was previously known that work- 
ing with expectations often simplifies the analysis of randomized algorithms 
(through the use of linearity of the expectation), but the reason why it ap- 
plies in our context is more delicate than usual, and may have applications 
also elsewhere. This leads to a canonical recoverable value p > 15/7. There- 
after we improve to p > 7/3 by designing a new approximation algorithm 
for MIS in graphs of small average degree. 



2 Handling low degree vertices 

It is instructive to consider separately for < d < 3. 

For isolated vertices, po = 1 both for MIS and for MWIS, as they can be 
included in the optimal solution without changing the degree of any other 
vertex. Note an important point here. Our notion of recoverable value 
treats every vertex of / separately. Hence we can remove isolated vertices 
without effecting the remaining vertices in I. This might not have been so 
simple had we consider aggregate properties such as the average degree of 
vertices in I. Removing an isolated vertex increases the average degree for 
the remaining vertices. 

Likewise, pi = 1. For MIS this follows because every vertex of degree 1 
can be included in the independent set, and its neighbor removed. At most 
one of these two vertices is in any independent set. For other vertices in I, 
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this process can only lower their degrees, and hence their recoverable value 
does not decrease (since pd is non- increasing) . For MWIS the argument 
is slightly more delicate. Let n be a vertex of degree 1 and let v be its 
neighbor. If Wu > Wv then the same argument as in the unweighted case 
works. However, if < Wv one should remove u from G, and reduce the 
weight of V io — Wu, thus obtaining a new graph G' . Any independent 
set /' in G' can be extended to an independent set / in G whose value is 
higher hy w^. If G /', simply increase the weight of v back to its original 
value. If w ^ I' , then include u in /. As the transformation from G to G' 
only removed u and did not increase the degree of any remaining vertex, 
monotonicity of pd implies that the recoverable of degree 1 vertices is 1. 

The above arguments also imply that we can remove recursively vertices 
of degree at most 1 with no harm to the recoverable value. Hence we may 
assume that all graphs have minimum degree at least 2. 

We show that for MIS essentially /?2 = 1 (see Theorem [T] for an exact 
statement). Consider a graph G of minimum degree 2. Let u be a vertex 
of degree 2, and let v and w be its neighbors. We now describe a process 
that we call 2- elimination. If there is an edge (u, w) we can safely add u to 
the independent set (and remove vertices v and w) because at most one of 
u,v,w is in any independent set, and u can replace any of v and w in an 
independent set. Hence it remains to deal with the case that v and w are 
not neighbors of each other. In this case, remove u from G, merge v and w 
to become a new vertex u' whose neighbors are the original neighbors of v 
and w (except for u that was removed from the graph). This gives a new 
graph G' . Any independent set /' in G' can be extended to an independent 
set / in G whose size is larger by 1. If € replace it by v and w. If 
u' ^ I', then include u in /. 

Now let us analyze how the recoverable value changes when transforming 
from G to G'. We assume here canonical recoverable values. Without loss 
of generality, either u G / or both v and w are in /. In G', we take an 
independent set /' that is induced by / in a natural way (if u € / than u is 
simply lost, iiv,w G / then u' G I'). Note that \I\ = |/'| — 1, hence we would 
like to show that their recoverable values differ by at most 1. If n G / then 
/' differs from / by the recoverable value of u which is indeed at most 1. If 
v,w (z I, then the recoverable values of G and G' differ by 

™™[^' ^ ^1 J + min[l, , ^, ]-min[l, ^ ] (1) 

Using the facts that dy,dyj > 2 and d^' < d^ + d^^j — 2 the value of 
([T]) is at most 1 whenever p < 10/3 (the worst choice of parameters being 
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= du, = 3 and du' = 4). Hence if one considers canonical recoverable 
values, then if p > 3 this implies (by definition) that p2 = 1, and if p < 10/3 
then 2-elimination achieves P2 = ^ (without affecting p for vertices of degrees 
larger than 2). 

The above discussion establishes: 

Theorem 1 When using canonical recoverable values for MIS then regard- 
less of the value of p one may assume that Po = pi = P2 = ^■ 

Theorem [T] cannot be extended to p^. Moreover, there is some fixed e > 
such that ps < 1 — e. This follows from the fact that approximating MIS in 
3-regular graphs in APX-hard 

3 Algorithms for MWIS 

In this section we assume that G = {V, E) has been preprocessed to include 
all isolated vertices in the output independent set. This allows us to simplify 
notation from m.m[l, p/ {d{v) + 1)] to p/{d{v) + 1). 

Theorem 2 Let G = {V, E) he a weighted graph without isolated vertices. 
There is a polynomial time algorithm for MWIS achieving a recoverable 
value of p = 2. Namely, the output of the algorithm is an independent set 
of weight at least Y^^^i s^^- 

Proof. We present two different polynomial time algorithms that achieve 
the desired bounds. One is based on linear programming. The other is much 
faster, but randomized. 

LP algorithm. Consider the following recoverable value LP (the RV LP). 
Xi IS a variable that indicates whether vertex i is in the independent set. 

maximize J2ieV 1^^^ 
subject to 

Xi + Xj < 1 for every edge {i,j)- 
< < 1 for every vertex i. 

The indicator vector of any independent set / is a feasible solution to 
the recoverable value LP. Moreover, the value of the LP then would be the 
recoverable value with respect to / (up to a scaling factor of p). Hence the 
optimal value of the LP is at least the desired recoverable value with respect 
to best independent set I (scaled by l/p). Treating ^^^^-^ as a weight of 
vertex i, the results of [13] imply that this LP has a half-integral optimal 
solution, and moreover that such a solution can be found in polynomial time. 
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Include in the independent set all vertices with Xj = 1 (getting credit Wi 
which is at least twice the credit Wi/{d{i) + 1) that the LP got for them), 
and discard all vertices of Xj = (the LP got no credit for them). Let 
Gi/2 be the weighted graph induced on all vertices assigned 1/2. Running 
weighted greedy on this graph ensures a solution of value '"^«/(^« + -*■)' 

whereas the LP (by having Xi = 1/2) got only half this credit. Hence after 
the rounding we obtain an integral solution of value at least twice that of 
the RV LP, implying p > 2. 

A fast randomized algorithm. Choose a random permutation and con- 
sider G2 as in Section n. 31 This graph is a forest (can be verified by orienting 
edges towards earlier vertices in the permutation). As every vertex v within 
/ belongs to G2 with probability j^^f, the expected weight of an indepen- 
dent set within G2 is at least "^^vei d{^+i • ^WIS can be found in forests 
in linear time, proving Theorem [2l ■ 

The bounds ^,^^y (i(v)+i (achieved by the random permutation algo- 
rithm of Section II. 2p and max/ Ylvei '^Wy/{d{v) + 1) of Theorem [2] are in- 
comparable. Nevertheless, it is not hard to see that both algorithms of 
Theorem [2] ensure not only the bound maxj ^^^j / {d{v) + 1) claimed 
in the statement of the theorem, but also the bound Ylvev d(v)+i • -'^'-'^ 
LP algorithm this is attained by the feasible solution that assigns 1/2 to all 
variables. For the fast randomized algorithm this follows because returning 
the first layer is a legitimate output for it. 

We now improve the recoverable value in the special case when the min- 
imum degree 6 in the input graph is sufficiently high. 

Theorem 3 Let G = {V, E) he a weighted graph with minimum degree 5. 
There is a polynomial time algorithm for MWIS achieving a recoverable value 
of p = ri(log5/loglog(5). 

Proof. Choose a random permutation and consider G^^-i as defined in 
Section 11.31 This graph is a J-inductive (orienting edges towards earlier 
vertices in the permutation, no vertex has outdegree larger than 5). As 
every vertex v within / belongs to Gs+i with probability (here we 

used the assumption that d{v) > 5), the expected weight of an independent 
set within G^+i is at least X^j,^/ • R-^'^ t^e SDP algorithm from [9] 

with approximation ratio ^{j^^^) (on 5-inductive graphs) to obtain a 
recoverable value with p = (log 5/ log log (5). ■ 
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4 Algorithms for MIS 



Here we show that for unweighted graphs there are randomized algorithms 
achieving a recoverable value of p strictly larger than 2. (The value of p 
will be further improved in Section O by extending the principles developed 
in the current section.) As in Section [21 we wish to simplify notation from 
min[l, + 1)] to p/{d{v) + 1). This requires that the input graph 

G = (y, E) has no vertices of degree less than 2. This can be assumed 
without loss of generality (see Section [2|). 

The basic idea of our algorithm is as follows. Pick a random permutation 
over the vertices and consider the graph G3 induced on layers Li U L2 U L3. 
The expected size of / n G3 is now Yliv^i d{v)+i (Here we use the fact that 
the minimum degree in G is 2). One would expect all three layers Li,L2 
and L3 to be of equal size. Moreover, every vertex in Li contributes no edge 
to G3, every vertex in L2 contributes at most one edge to G3, and every 
vertex in L3 contributes at most two edges to G3. Hence one would expect 
the average degree of G3 to be at most 2. Recall that the algorithm of 
[ID] (LP+greedy) obtains an approximation ratio of ^ (on connected 
graphs). Hence, applying this algorithm to G3 (and using the bounds of [10] 
even though G3 need not be connected), one may hope to attain recoverable 

value offY^vGiKJ^- 

Summarizing, we have algorithm PLG (Permute, LP, Greedy). 

1. Pick a random permutation over the vertices and consider the graph 
G3 induced on layers Li U L2 U L3. 

2. Find a half-integral solution to the LP for MIS on G3. Put the ver- 
tices with value 1 in the independent set and remove the vertices with 
value 0. 

3. Run the greedy algorithm on the subgraph induced on the vertices 
that remain from G3. 

The above argument of why PLG achieves a value of p = 15/7 had two 
gaps in it. One relates to the assumption that G3 is connected (which might 
not hold), and the other to the assumption that sizes of layers are equal to 
their expectations. We shall deal with each one of them separately. 

As noted, the approximation ratio of ^ _|_3 of the algorithm of [lOj 
assumes that the graph is connected. Indeed, trying to use this expression 
with davg < 1 (which may well be the case for graphs with isolated vertices), 
one obtains approximation ratios better than 1 which of course cannot be 
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true. Nevertheless, when the average degree is at least 2, the following the- 
orem shows that the bound of ^ does apply, regardless of whether the 
graph is connected or not. Observe that all connected graphs are captured 
by the theorem (either they are trees and then greedy solves them optimally, 
or their average degree is at least 2), and hence our proof can also replace 
the one given in |10] for connected graphs. 

Theorem 4 If G is a graph of average degree at least 2 then MIS can be 
approximated within ratio of ^ -j-3 • 

Proof. Recall that by the results of [lOj, the approximation ratio of the 
greedy algorithm on graphs with average degree davg and independence ratio 
a is fia] = tj — — ^— • This ratio as a function of a is decreasing in the 
range (0,1/2]. For a = 1/2 we obtain the desired approximation ratio of 
„ , ^ _i_o . Hence to prove Theorem |3] it suffices to show that we can assume 
that a{G) < 1/2. 

Consider an optimal half integral solution to the standard LP relaxation 
of the MIS problem (as implied by [11]). Let ONE denote the set of variables 
receiving 1, ZERO the set of variables receiving 0, and HALF the set of 
variables receiving 1/2. Let H be the subgraph induced on the vertices 
whose corresponding variables are in HALF. The independence ratio of 
H is at most 1/2, as desired. Necessarily \ONE\ > \ZERO\ (otherwise 
they would both be in HALF). Remove ONE and ZERO and all edges 
connected to them (and thus only H is left), and add instead \ONE\ isolated 
edges (one edge for each vertex of ONE), thus obtaining a new graph G' . 
Observe that the size of the maximum independent set does not change, but 
ct(G') < 1/2 as desired. It remains to analyze the average degree of G'. In 
the removal phase \ONE\ + \ ZERO\ vertices and at least \ZERO\ edges are 
removed (every vertex in ZERO has an edge to ONE, otherwise it could 
be moved to HALF). Thereafter 2|OA'^£^| vertices and \ONE\ edges are 
added. Hence altogether exactly \ONE\ — \ZERO\ vertices and at most 
\ONE\ — \ZERO\ edges are added. If the average degree of G is at least 2, 
the average degree of G' cannot be larger than that of G. 

Summarizing, we have shown how to transform G into a new graph G' 
for which a{G') < 1/2, the average degree of G' is at most that of G, and the 
maximum independent sets in G and G' have the same size. Applying greedy 
to G' is the same as taking ONE into the solution and applying greedy only 
on H, which is precisely the algorithm of [TO]. By the properties of G', the 
approximation ratio is at least as desired by Theorem HI ■ 
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Apparently, the result of Theorem U] can be extended also to davg > 
3/2 (Halldorsson, private communication), though this is not needed in our 
paper. 

Now let us deal with the issue of expectations. Let us first explain the 
problem. We have various information about expectations. Namely, E[\I n 
V{G3)\] = Evei^/i^iv) + 1), and E[\Li\] = E[\L2\] = E[\L3\]. Moreover, 
the number of edges in G3 is at most 2IL3I + |Li|. If things behave exactly 
as expectation the average degree of G3 is at most 2, Theorem H] applies and 
we get an approximation ratio of | X^tjg/ 3/((i(w) + 1) as desired. However, 
there is also variability in the above random variables, and it can be quite 
large. (The complete bipartite graph K^^^ illustrates this variability.) With 
some probability the average degree in G3 may be larger than 2, and with 
some probability smaller. Hence the approximation ratio on G3 is a random 
variable. Moreover, when the average degree of G3 is too small, the bounds of 
Theorem |4] no longer hold. Moreover, the size of the maximum independent 
set in G3 might be correlated with the average degree in complicated ways. 

We present here a very simple way to handle all the above complications. 

Theorem 5 The expected size the independent set found by algorithm PLG 
is at least ^ X]?;e/ d{v)+i ' '^^^''^^ ^ ^•^ 0''^y independent set in G. 

Proof. Our proof uses two important facts. One is that Theorem H] applies 
also to disconnected graphs. The other (which the reader may verify) is that 
algorithm PLG when run on a disconnected graph gives the same outcome 
(or more formally, the same probability distribution on outcomes) as that 
when PLG is run on each connected component separately. 

Let e > be any desired level of accuracy with which we want to estimate 
the performance guarantee of PLG. Let G be an arbitrary graph on which 
we run PLG, let n be the number of its vertices, and let / be an independent 
set in G. As a thought experiment, make N disjoint copies of G, where N 
is chosen to be sufficiently large as a function of e and n. Call the resulting 
graph on nN vertices Gn, and let 1^ be the independent set composed 
from the copies of /. Run PLG on Gn- With respect to PLG, the 
random variables Li, L2, L3, and In n G3 are all concentrated around their 
expectation within relative errors that are o(e) (by our large choice of A^ 
and the fact that these random variables are each a sum of A^ bounded and 
independent random variables, one for each copy of G). Hence on Gat, with 
probability that can be made (1 — e/2) PLG finds an independent set of size 
at least (1 — e/2) ^ S^e/^v d{v)+i ■ Hence in expectation, per copy of G, the 
size of the independent set found is at least (1 — e)^ l^vei d(i\+i • Letting 
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e tend to Theorem [5] is proved. 



5 Improved recoverable values for MIS 

The main point of Theorem [5] is that algorithm PLG has a recoverable value 
with p strictly larger than 2. This value is at least 15/7 (as the theorem 
shows) and at most 3 (for example, on the complete bipartite graph K^^n-'i)-, 
and it would be interesting to narrow this gap. In this section we use the 
notion of 2-elimination from Section [2] do design variations on algorithm 
PLG for which we prove a value of p strictly larger than 15/7. 

Section [2] shows that for p < 10/3, by repeatedly applying 2-elimination 
as long as possible, we may assume that G has minimum degree 3. This 
allows us to replace G3 by G4 in algorithm PLG, capturing a higher fraction 
of vertices of /. (In contrast, if there are vertices of degree 2 in /, their 
fraction in G3 is already 1 and cannot increase in G4.) This gain is offset to 
some extent by the higher average degree of G4 compared to G3 , which gives 
a poorer guarantee in the approximation ratio ^ of Theorem HI Still, 

we get a recoverable value of p > 4: 2-3+3 ~ ^0/9. More specifically, for the 
original / in G, vertices of degree at most 2 contribute 1 to the recoverable 
value, and vertices of degree at least 3 contribute at least 20/9{d^ + 1). 

Corollary 6 A canonical recoverable value p = 20/9 is achievable for MIS. 

We can further improve the recoverable value by improving over the 
performance guarantees of Theorem |4] (which are based on those of |10j ) 
for graphs of a given average degree. We use 2-elimination to do so for the 
special case of davg = 2, which captures the graph G3. 

Theorem 7 On graphs with average degree 2 (or less), MIS can be approx- 
imated within a ratio of 7/9. 

Proof. Let G = {V, E) be a graph of average degree 2. We shall have 
a sequence of partitions of V into Vi and V2, and consider the graphs Hi 
induced on Vi and H2 induced on ¥2- Initially, Vi = V and V2 = ^. Move 
vertices from Vi to V2 by iteratively using any of the following simplifications 
(as long as they are applicable to the current graph Hi). 

0- elimination: move isolated vertices from Vi to V2. 

1- elimination: move vertices u of degree 1 and their neighbors from Vi 
to V2. That is, if {u,v) is an edge in Hi and u has degree 1 in Hi, move 
both u and v to V2 (and hence the edge {u,v) to H2). 
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2-elimination: Consider a vertex vi of degree 2 in Hi, incident to 
V2,V3 G V\. If V2,vj, are neighbors, move all three vertices to V2 (inducing 
a triangle in H^)- Otherwise move v\ and to V2 ('^'2 is chosen arbitrarily, 
V'i would do just as well), leave ^3 in Vi, but change G (and hence also the 
induced graphs B.\ and i/2) by connecting V'l, to those neighbors of V2 that 
are still in V\, and disconnecting V2 from these neighbors. Observed that 
the edge (^1,^2) is now in 7^2- 

NT (for Nemhauser- Trotter) : Run the LP for MIS on }i\ and find a 
half integral optimal solution (as in |14j). If there are vertices with integer 
values (0 or 1), move them from Vi to 1^. 

Before continuing to describe our algorithm, let us present some conse- 
quences of the simplifications. 

Lemma 8 li2 has a maximum independent set I2 such that there is no 
edge between I2 and Vi . Moreover, such an independent set can he found in 
polynomial time. 

Proof. The proof is by induction on the number of simplification steps. 
The base case is when H2 is the empty graph, and then the lemma trivially 
holds. Given that the lemma holds for H2, let H2 be the graph after one 
additional simplification step. The inductive hypothesis implies that in H2 
one finds a maximum independent set with no neighbors in Vi, and hence 
no edges to those vertices newly added to H'2. Hence it remains to show 
that in the subgraph induced on those vertices newly added to H'^ one can 
find a maximum independent set not connected to any vertex of the new 
Vi. For a 0-elimination, take the new vertex, for a 1-elimination take the 
vertex u of degree 1, for a 2-elimination in which a triangle was moved 
take vi, m. a 2-elimination in which an edge (1^1,1^2) was moved (and other 
neighbors of V2 were transferred to v^) take V2, and for NT take those vertices 
of value 1. (These 1-vertices cannot have neighbors in the new V2 since 
vertices there had fractional value 1/2. Moreover, the 1-vertices form a 
maximum independent set on the subgraph induced on vertices of value 
and 1. Otherwise, there is a set of vertices of value with fewer neighbors of 
value 1, but then the value of the LP could be raised by changing all these 
values to 1/2.) ■ 

Let G' be the graph obtained from G by using the above simplifications. 
Observe that G' might not be equal to G because 2-elimination moves end- 
points of edges from one vertex to another. Nevertheless: 
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Lemma 9 Given any independent set I' in G' one can find in polynomial 
time an independent set I of equal size in G, and vice versa. 

Proof. The only simplification step that makes G' different from G is a 
2-elimination step in which the neighbors V2,vj, of vi do not share an edge. 
Consider G with the partition Vi, V2 just before the 2-elimination occurs and 
G' with the partition V/, V^' immediately after the elimination took place. 
Let / be an independent set in G. If / contains no vertex from fi,V2)f^3) 
then / is independent also in G' . If / contains one vertex from fi, f2, ^^3, put 
V2 in /'. Lemma [8] implies that we may assume that / H V2 has no edges to 
f2, and hence /' is indeed an independent set. If / contains two vertices from 
vi,V2,V2„ then they must be V2 and V3, and then / is also an independent 
set in G' . Observe that / cannot contain all three ^1,^2,^3. Conversely, let 
/' be an independent set in G' . If /' contains no vertex from vi,V2,v^, then 
/' is independent also in G. If /' contains one vertex from wi,W2,'V3, put vi 
in /. Lemma [8] implies that we may assume that I' r)V2 has no edges to vi, 
and hence / is indeed an independent set. If /' contains two vertices from 
'Vi,V2,V3, then they must be V2 and ^3, and then /' is also an independent 
set in G. Observe that /' cannot contain all three vi,V2,V3. ■ 

When none of the above simplifications are applicable, do the following: 

1. Run the greedy algorithm to find an independent set Ii in Hi. 

2. In H2 find a maximum independent set I2 C V2 with no edges to Ii, 
as in Lemma m 

3. Use Lemma [9] to output an independent set in G of size + |/2|. 

The above lemmas imply that the algorithm can be run in polynomial 
time. We now analyse the size of the independent set that it finds. 

When all simplification steps end, let ni and cni denote the number of 
vertices and edge in Hi. Observe that c > | because the minimum degree 
in Hi is 3. Let a be the independence ratio of Hi, and observe that q < 1/2 
(otherwise NT could have been applied). The number of vertices in H2 is 
n — ni. Since G contains n edges and our simplifications do not increase the 
number of edges, the number of edges in H2 is at most n — cui. As every 
graph G = {V, E) has an independent set of size at least \V\ — \E\ it follows 
that the size of an independent set in H2 is at least (c — l)ni. Since the 
average degree in Hi is 2c the greedy algorithm finds in Hi an independent 
set of size ^c+i+a ''^^' follows that the approximation ratio of our algorithm 
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is at least — . For c > 3/2 and < a < 1/2 Lemma [TO] implies that 
the aforementioned expression is at least 7/9, proving Theorem [71 ■ 



Lemma 10 Let {x,y) G [3/2, oo) x [0,1/2]. Then 

jy-^iyi 2x+y'-x+3xy-l — 9' 

Proof. 2xW-^+^y-y = 1 + , . Thus it suffice to prove that 

2x+y^—x+3xy—l 2x+y^~x+3xy—l 

for X and y in the above ranges 2x+y^-x+3xy-i — ~i "-"^ ^■^^ ~'~ — 2x — 
12xy - 9y + 7 > 0. Let g{x, y) = Ax"^ + 2y^ - 2x - 12xy -9y + 7. Clearly 
= 8j; - 2 - 12y > for such x, y and = 4y - 2x - 9 < for 

such Hence g{x,y) > g{3/2, 1/2) = 0. As desired. ■ 

Replacing the algorithm of Theorem [Jj by the algorithm of Theorem [7| in 
algorithm PLG we obtain: 

Theorem 11 A canonical recoverable value p = 7/3 is achievable for MIS. 

6 MIS in /c-colored graphs 

In analyzing algorithm PLG we only used the fact that G3 has small average 
degree. However, G3 has other structural properties as well. It is 3-colorable, 
and furthermore, the 3-coloring can be found efficiently (e.g., by coloring 
the vertices of G3 inductively in the order in which they appear in the 
permutation that generated G3). We use the term A:-colored graph to denote 
a graph with a given fc-coloring. The following proposition is known [12j . 



Proposition 12 MIS on k-colored graphs can be approximated within 2/k. 
Proof. We present two proofs. 

1) Consider all possible pairs of color classes. At least one such pair contains 
at least 2/k fraction of opt. Each such pair is a bipartite graph on which 
MIS can be solved exactly. 

2) Use an LP for MIS which by its half integrality and standard rounding 
techniques allows us to assume that opt is at most half the graph. Thereafter, 
the largest color class provides at least a 2/k approximation. ■ 

One may hope that Proposition [12] can be improved, for example, by 
combining the two different proofs given to it. If so, this would give an 
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approximation ratio better than 2/3 for G3, hence potentially replacing or 
even surpassing the 5/7 bound that we used for G3 based on Theorem HI 

We show that Proposition [12] is nearly tight, unless vertex cover can 
be approximated within a ratio better than 2. (In particular, this implies 
UGC-hardness, since hardness of unique games is a stronger assumption 
than inapproximability of VC beyond a ratio of 2, see [I3j-) 

Let G be an n-vertex graph in which one wants to approximate VC 
within a ratio better than 2. As is well known (e.g., [lO]), this is the same as 
distinguishing for some e > whether a(G), the size of MIS in G, satisfies 
a{G) > (1/2 - e)n) or a(G) < en. 

For k > 2 construct from G a A:-colored graph G' as follows. For every 
vertex v (z G, the graph G' contains k copies vi,...Vk- All vertices with 
the same index i form an independent set (hence a color class). Between 
any two distinct color classes other than class k, place a bipartite graph 
mimicking the edge pattern of G, connecting vertex Vi with vertex uj (where 
k j ^ i ^ k) a {v,u) (or (n, is an edge in G. Each vertex of the 
kth class is connected only to its own copies Vi in the other classes. Hence 
the bipartite graph between class k and any other class is simply a perfect 
matching. 

Lemma 13 In the reduction above, a{G') = n + [k — 2)a{G). 

This gives a gap of kn^2-o'(ekn) = + between no and yes in- 

stances. 

To prove Lemma [T3l we shall use the following lemma. 

Lemma 14 There is a MIS I' in G' such that for every vertex v €z G, either 
Vi G /' for all i ^ k, or £ I' ■ 

Proof. Consider an arbitrary independent set /' in G' . If either no copy or 
only one copy of v is in /', then without loss of size of /' we may take this 
to be Vk- If at least two copies of v are in then neither one of them can 
be Vk- But then, we can add all other Vi with i ^ k to I' , since none of them 
can be a neighbor of a vertex already in /'. ■ 

Lemma O implies that all vertices v (z G for which /' form an 
independent set in G. Lemma [13] easily follows. Applying [13] and [13] we 
immediately obtain: 

Theorem 15 Let k be an integer greater than 2. Assume that for every 
e > it is NP-hard to distinguish between graphs with independent set of 
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size at least (2 — e)n to graphs with independent sets of size at most en. 
Then, for arbitrary 6 > it is NP-hard to approximate MIS on k- colored 
graphs within a factor larger than f + 5- In particular, approximating MIS 
on k- colored graphs within a factor larger than | + 5 is unique- games hard. 

Recall that a vertex cover is the complement of an independent set. 
For fe-colored graphs, vertex cover can be approximated within 2 — 2/k. 
Theorem [15] implies (under the assumptions of the theorem) that this is 
best possible, up to low order terms. 

Note that G' constructed in the proof of Theorem [15] has additional 
structural properties beyond being fe-colorable (many symmetries and an 
especially simple pattern of connections with color class k). This excludes 
better than 2/k approximation ratios even on more restricted families of 
graphs than those with a fc-coloring. One such family is that of graphs com- 
posed of one part that is a tree, another that is an independent set, and 
edges between these two parts. These graphs are 3-colorable, but in addi- 
tion, the pattern of edges between two of the color classes is very simple. 
Our proof implies that approximating MIS within a ratio better than 2/3 is 
hard on these graphs. Such graphs actually arise in a variation of the algo- 
rithm PLG, if instead of the first three layers one takes the first two layers, 
forming the part that is a tree, and the first layer in the reverse order of 
the permutation, forming the part that is an independent set. Note however 
that the graph Gk used in algorithm PLG is A;-colorable due to a structural 
property of being (/c— l)-degenerate, whereas the graph G' constructed in the 
proof of Theorem 1151 is not necessarily [k — l)-degenerate. MIS is APX-hard 
even on 2-degenerate graphs (this can be proved by removing one vertex 
from a 3-regular graphs), but it appears possible to obtain algorithms that 
approximate MIS on such graphs within a ratio better than 2/3. This may 
potentially lead to higher recoverable values than those shown in this work. 

7 Discussion 

We presented algorithms that achieve a recoverable value with p = 2 for 
MWIS and /? = 7/3 > 2 for MIS. We also showed that previous algorithms 
(such as greedy and greedy-|-LP) fail to achieve a recoverable value with 
p bounded away from 1. Finally, we proved a hardness of approximation 
result for MIS on graphs with a given /c-coloring. Some questions remain 
open, most notably, how much higher can p be pushed (recall that it cannot 
reach 4), and whether there truly is a gap between MIS and MWIS with 
respect to recoverable value. 
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Let us provide some comments on our permutation based algorithms. 
They are not guaranteed to produce a maximal independent set. After 
running them it may well be that there remain vertices in G (rather than 
in Gk) not connected to the independent set that they find. This forms a 
residual graph. Hence if one wants to run our permutation based algorithms 
in practice, one can improve their performance by extracting an independent 
set also from the residual graph. We have not attempted to analyze the effect 
of this on the recoverable value. 

The notion of recoverable value and the associated algorithms are most 
effective for graphs in which the vertices in the maximum independent set 
have small degree compared to the average degree in the graph. For some 
other classes of graphs (notably, regular graphs O |3] ) our performance guar- 
antees are poorer than those given by other algorithms. 
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A Some negative examples 

We show that LP+greedy (mentioned in Section II. 2p does not meet the 
performance guarantees of Theorem [21 

Proposition 16 For arbitrarily large integers k, d there are graphs such 
that neither greedy nor LP+greedy find an independent set of size k + \ 
whereas the recoverable value is at least Hence they cannot guarantee 
a recoverable value with p bounded away from 1. 

Proof. Consider the following example. There are three layers. The first 
layer consists of k vertices, each with d distinct neighbors in the second 
layer. Every one of the dk second layer vertices has d—1 distinct neighbors 
in the third layer. All dk{d— 1) vertices of the third layer form a clique. The 
second layer is an independent set with dk vertices and recoverable value of 
dk-gjp^. A greedy algorithm might choose the k vertices in the first layer and 
a clique vertex for a value of A; + 1. This applies also to greedy -|-LP because 
the optimal half integral solution (to the standard LP, not the recoverable 
value LP) of this instance will assign 1/2 to all vertices, provided that d and 
k are large enough. (No third layer vertex will be assigned the value 1 since 
then all third layer vertices must be assigned 0. No second layer vertex will 
be assigned 1 because this forces d—1 distinct third layer vertices to be 
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assigned rather than 1/2. No first layer vertex will be assigned 1 because 
this forces d distinct second layer vertices to be assigned rather than 1/2.) 
■ 

We remark that the example in Proposition [16] (and also simpler exam- 
ples suffice) shows that the fast algorithm of Theorem [2] does not offer an 
expected value of p bounded away from 2. The expected number of vertices 
in the first two layers can be easily seen to be less than 2(k + 1) (by using 
linearity of expectation to sum over individual vertices), and hence the ex- 
pected size of the independent set found will not exceed 2{k + 1) (which by 
increasing d and k can be made to imply p arbitrarily close to 2). 

The following graph shows that the RV LP together with our round- 
ing procedure cannot give a value of p > 2 for MWIS. Consider a graph 
composed of clique of size k connected via a complete bipartite graph to an 
independent set of size k. Vertices of the clique have weight 2k/{k + 1) and 
other vertices have weight 1. The weight of maximum weight independent 
set is k and its recoverable value is pk/{k + 1). The RV LP has an optimal 
solution that assigns value 1/2 to all vertices. Thereafter, weighted greedy 
may choose a clique vertex in its first step, thus discarding all other vertices 
and finding a solution of weight 2k /{k + 1). Hence p = 2. 
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